NaDiR: Naive Distributional Response Generation

نویسندگان

  • Gabriella Lapesa
  • Stefan Evert
چکیده

This paper describes NaDiR (Naive DIstributional Response generation), a corpus-based system that, from a set of word stimuli as an input, generates a response word relying on association strength and distributional similarity. NaDiR participated in the CogALex 2014 shared task on multiword associations (restricted systems track), operationalizing the task as a ranking problem: candidate words from a large vocabulary are ranked by their average association or similarity to a given set of stimuli. We also report on a number of experiments conducted on the shared task data, comparing first-order models (based on co-occurrence and statistical association) to second-order models (based on distributional similarity).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval

The naive Bayes classiier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval. We review some of the variations of naive Bayes models used for text retrieval and classiication, focusing on the distributional assumptions made about word occurrences in documents.

متن کامل

How to make words with vectors: Phrase generation in distributional semantics

We introduce the problem of generation in distributional semantics: Given a distributional vector representing some meaning, how can we generate the phrase that best expresses that meaning? We motivate this novel challenge on theoretical and practical grounds and propose a simple data-driven approach to the estimation of generation functions. We test this in a monolingual scenario (paraphrase g...

متن کامل

Maturation of Lymphocyte Immunophenotypes and Memory T Helper Cell Differentiation During Development in Mice

The goal of this study was to systematically investigate the ontogeny of lymphoid populations throughout postnatal development. In CD-1 mice, peak lymphocyte numbers occurred in blood on postnatal day 10 (d10) including those for natural killers (NK1.1), B cells (CD19), T helper (CD3CD4), naïve T helper (CD4CD62LposCD44low), memory T helper (CD4CD62LnegCD44high), and T cytotoxic (CD3CD8) cells....

متن کامل

Cortisol and epinephrine control opposing circadian rhythms in T cell subsets.

Pronounced circadian rhythms in numbers of circulating T cells reflect a systemic control of adaptive immunity whose mechanisms are obscure. Here, we show that circadian variations in T cell subpopulations in human blood are differentially regulated via release of cortisol and catecholamines. Within the CD4(+) and CD8(+) T cell subsets, naive cells show pronounced circadian rhythms with a dayti...

متن کامل

Bag-of-Embeddings for Text Classification

Words are central to text classification. It has been shown that simple Naive Bayes models with word and bigram features can give highly competitive accuracies when compared to more sophisticated models with part-of-speech, syntax and semantic features. Embeddings offer distributional features about words. We study a conceptually simple classification model by exploiting multiprototype word emb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014